Melody Perception and Extraction from Audio
نویسنده
چکیده
Human beings have a very sophisticated sense of hearing. While the physiological aspects of the auditory systems are well established, the perceptual and cognitive aspects are still not well understood. One active field of research in computer audition is automatic melody extraction from audio. The applications range from query-byhumming to genre classification and cover detection. On the cognitive side, the research in melody perception has not been very active in the past 15 years, the main reason being an intrinsic difficulty in designing and conducting experiments. To understand how humans perceive melodies we have to understand the mechanism for pitch perception, stream segregation and melody perception. Only the first two tasks have been researched extensively while most researches on melodies have been relegated to simple unaccompanied melodies. In this paper I describe the state-of-the-art of melody perception and extraction from both a cognitive and a computational points of view. Some of the proposed algorithms for melody extraction model the human hearing to a certain degree and exploit perceptual cues to improve accuracy, while other use signal processing and machine learning techniques with little or no regard to physiological or cognitive models. Proposed algorithms have achieved good results in some circumstances but they are far from being perfect. From our analysis it appears that perceptually motivated algorithms can improve the accuracy but future research in signal processing might find alternative ways to solve the problem.
منابع مشابه
Towards Computational Auditory Scene Analysis: Melody Extraction from Polyphonic Music
This paper describes an efficient method for the identification of the melody voice from the frame-wise updated magnitude and frequency values of tone objects. Most state of the art algorithms employ a probabilistic framework to find the best succession of melody tones. Often such methods fail, if there are several musical voices with a comparable strength in the audio mixture. In this paper, w...
متن کاملAudio Melody Extraction for Mirex 2009
This paper describes our submission to the audio melody extraction evaluation addressing the task of identifying the melody pitch contour from polyphonic musical audio. It shall give an overview about the algorithm and a discussion of the evaluation results. The presented algorithm is a derivative of our submission to MIREX’06. Major changes between the two versions are highlighted and the impa...
متن کاملMirex2014: Audio Melody Extraction
This paper describes our submission for the audio melody extraction task of the Music Information Retrieval Evaluation eXchange (MIREX 2014). Our algorithm first separates the vocal spectra from polyphonic sound spectra. Melody extraction and vocal activity detection are applied to the separated spectra.
متن کاملOptimizing Melodic Extraction Algorithm for Jazz Guitar Recordings Using Genetic Algorithms
Extraction of the main melody of a musical piece is a preliminary step in the process of transcribing the piece. Automatic melodic extraction is the task of computationally extracting what a human listener would perceive as the main melody of a polyphonic recording. Several melodic extraction systems have been proposed. However, such systems normally require a number of parameters to be manuall...
متن کاملMelody Extraction in Music Audio Signals by Melodic Component Enhancement and Pitch Tracking
This extended abstract is for the “Audio Melody Extraction” contest of MIREX2009. We describe an algorithm that estimates the melody line from a music audio signal. The algorithm is comprised of two stages: melodic component enhancement and melody line tracking. Only a few researchers used this approach because of difficulties of the melody enhancement. Our enhancement algorithm focuses on temp...
متن کامل